(12) 



UK Patent Application „„ GB o„2 257 273„3,A 

(43) Date of A publication 06.01.1993 



(21) Application No 9212516.0 

(22) Date of filing 12.06.1992 



(30) Priori^ data 
(31) 0772200S 



(32) 27.06.1991 (33) US 



(71) Applicant 

Digital Equipment Corporation 

(Incorporated in tiie USA • Massachusetts) 

146 Main Street, Maynard, Massachusetts 01754, 
United States of America 

(72) Inventors 

Christian David Saether 
Peter StoppanI Jnr 

(74) Agent and/or Address for Service 
Kllburn & Strode 

30 John Street, London, WC1N 2DD, United Kingdom 



(51) iNTCL» 

G06F 12/02 

(52) UKCL (Edition L) 

G4A AMX 

(56) Documents cited 
None 

(58) Field of search 

UKCL (Edition K) G4A AFN AMX 
INTCL« G06F 12/02 15/40 
Online database: WPI 



(54) Data storage system with device Independent file directories 



(57) A computer file system 100, having a multiplicity of distinct dis!< Storage devices 108, 110, 112, includes a muitiplicHy of 
file directories 152, stored on various disks. Each file directory is used to translate file names into corresponding tag values. 
For each disk there is a file descriptor table 154 with a file descriptor entry for every file stored on the disk. A single tag 
directory 156 contains one tag entry for every file stored in the system. The tag directory is used by the file system to find a 
file by translating a tag value into a pointer to point to the disk on which the file Is stored and a pointer indicating the file's file 
descriptor entry. To move a file from a first disk to a second disk, the file is copied to the second disk, a new file descriptor 
entry for the copied file is generated in the file descriptor table 154 for the second disk, the copy of the file on the first disk is 
de-aiiocated, and the tag entry for the file is updated to point to the second disk and to the file's new file descriptor entry. 
Thus, a file can be moved from a first disk a second without having to locate and update all the corresponding file directory 
entries. Also the file system includes a routine that monitors disk loading and unused disk capacity. It detemiines when disk 
usage is imbalanced and automatically moves files among the disks so as to better balance disk usage. 
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DATA STORAGE SYSTEM AND METHOD 
WITH DEVICE INDEPENDENT FILE DIRECTORIES 

The present invention relates generally to data storage management on computer 
systems having multiple storage devices, and particularly to a file directory system 
and method for enabling files to be redistributed among the computer system's 
storage devices in a manner that Is transparent to the computer system's users. 

5 

Large, commercial computer systems (sometimes called data centers) typically 
have many types of data storage devices. Such data storage devices include 
10 solid state disks (RAM disks), cheap and slow magnetic disks, expensive and 
fast magnetic disks, striped disks, shadowed disks, optical disks, tape drives, 
and so on. 

When a file is created In a computer system having multiple disks, the computer's 
15 file system selects a disk on which to store the file. In prior art systems, once 

a file is created and stored on one of the file system's disks, It Is permanently 

bound to that disk until it Is deleted. In other words, the file is device-dependent. 

This does not efficiently utilize the file system's disks. For example, some disks 

will be come full while others are nearly empty. The system administrator must 
20 take explicit and disruptive actions to redistribute the files among the system's 

disks. This may require copying (and renaming) files, or re-lnitiallzing the file 

system. 
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As another example of a problem associated with the prior art file systems, some 
disks can become very active while others have almost no activity* This happens 
when the most commonly used files are allocated to one or a few disks in the 
file system, while the remaining disks contain old and Infrequently used files. 
5 Again the system administrator must take explicit and disruptive actions to 
redistribute the files among the system's disks. 

Current UNIX~ file systems can span, or support, only a single volume, or even 
just a portion of one volume. A volume can be one or more physical disks, but 
1 0 the file system treats a multi-disk volume as a single large disk. Therefore, UNIX 
file systems technically span only one disk, which means that UNIX file systems 
utilize disk storage just as poorly, if not worse than, file systems that span multiple 
disks and allocate files in a device-dependent manner. 

1 5 The present invention solves these problems by allowing a file system to easily, 
transparently (i.e., users and applications are not aware of file movement) and 
automatically redistribute files among its disks. To transparently move files 
among several disks, the file system of the present invention allocates files in 
a device-independent manner. 

20 

The present invention is defined in the accompanying 
claims to which reference should now be made. 

The present invention in its broad form resides in a method and a file system 
for selective redistribution of files for use with a computer system having a central 

25 processing unit and a multiplicity of distinct data storage devices on which files 
are stored, comprising: a multiplicity of file directories, stored on various ones 
of said multiplicity of data storage devices, each of which contains a set of file 
directory entries, each file directory entry denoting a file name and a 
corresponding tag value; wherein multiple file names corresponding to a single 

30 file may be stored in multiple ones of said file directories; a separate file 
descriptor table for each of said multiplicity of data storage devices, each file 
descriptor table containing a file descriptor entry for every file stored on the 



-3. 



corresponding data storage device; a tag directory, stored on a selected one 
of said multiplicity of data storage devices, containing tag entries for a defined 
set of tag values, wherein every file stored in said multiplicity of data storage 
devices is represented by a unique one of said tag entries, each said tag entry 
5 denoting the one of said multiplicity of dgita storage devices on which the file 
corresponding to said tag entry is stored; and file moving means, operable by 
said central processing unit, for moving a specified file from a first one to a 
second one of said multiplicity of data storage devices, including means for 
copying said specified file to said second one of said multiplicity of data storage 

1 0 devices, generating a file descriptor entry for said copied file on said file descriptor 
table for said second one of said multiplicity of data storage devices, and 
changing the tag entry in said tag directory corresponding to said specified file 
so that said tag entry denotes said second one of said multiplicity of data storage 
devices; whereby a file can be moved from a first one to a second one of 

15 said multiplicity of data storage devices without having to locate and update all 
corresponding file directory entries. 

A preferred embodiment provides a file system for a computer having a multiplicity 
of distinct disk storage devices. The file system includes a multiplicity of file 
20 directories, stored on various disks. Each file directory Is used to translate file 
names into corresponding tag values. For each disk there is a file descriptor 
table with a file descriptor entry for every file stored on the disk. A single tag 
directory contains one tag entry for every file stored in the system. 

25 A tag directory Is used by the file system to find a file by translating a tag value 
Into a pointer to the disk on which the file is stored and a pointer to the file's file 
descriptor entry. To move a file from a first disk to a second disk, the file is 
copied to the second disk, a new file descriptor entry for the copied file Is 
generated in the file descriptor table for the second disk, the copy of the tile on 

30 the first disk and Its file descriptor entry are de-allocated, and the tag entry for 
the file is updated to point to the second disk and to the file's new file descriptor 
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entry. Thus, a file can be moved from a first disk to a second without having 
to locate and update all the corresponding file directory entries. 

In a preferred embodiment, the file system includes a routine that monitors disk 
5 loading and unused disk capacity. It determines when disk usage is imbalanced 
and automatically moves files among the disks so as to better balance disk 
usage. 

10 - . 

The present invention can be put into practice in various 
ways one of which will now be described by way of example 
with reference to the accompanying drawings in which: 

15 Figure 1 Is a block diagram of a computer system with multiple data storage 
devices and a file management system ; 

Figure 2 is a conceptual block diagram of a prior art, single-level file lookup 
system ; 

20 

Figure 3 is a conceptual block diagram of a two-level file lookup system in 
accordance with a preferred embodiment of the present invention ; 

Figure 4 Is a flow chart depicting movement of a file from one disk to another 
25 disk in accordance with a preferred embodiment of the present invention ; 

Figure 5 is a block diagram of the tag directory data structure ; and . 

Figure 6 is a flow chart of a disk us^ge monitor and balancing routine. 

30 



Referring to Figure 1» there is shown a computer system 100 having a central 
processing unit 102 which is interconnected by system bus 104 to secondary 
memory 106 (which comprises magnetic disk storage devices 108-1 12), primary 
memory 1 20 (i.e., high speed, random access memory), virtual memory manager 
124, and one or more user interfaces 126, Stored In primary memory 120 are 
currently executing application programs 130, as well as operating system 
software, such as the computer's file system 140 (sometimes called the file 
manager program). 

Data structures 150 concerning disk storage that are maintained by the file 
system 1 40 Include numerous file directory tables 1 52, file descriptor tables 1 54 
(one per logical disk volume), and a tag directory 156, which is a new data 
structure introduced by the present invention. The data stmctures 1 50 are stored 
in secondary memory 106. In particular, the system can contain a virtually 
unlimited number of file directories 152, which tend to be stored more or less 
randomly throughout the disks of secondary memory. RIe directories 152 are 
created by the system's users, at their convenience, for grouping sets of disk 
files. Users access existing files by referencing a file name in a specified file 
directory 152. The user's request is passed to the file system 140, which 
determines where in secondary storage 1 06 the specified file is located and then 
either opens the file or performs some other specified task, such as making a 
copy of the file, printing the file, deleting the file, etc. 

As will be described below, each disk 108-112 has a file descriptor table 154 
containing an entry for each file stored on that disk. Finally, there is just one 
tag directory 1 56 in the system, which is usually stored on a single disk, although 
in one preferred embodiment there is a "secondary" portion of the tag directory 
156 whicii can be split off and stored elsewhere. 



The file system 140 includes a modified set of routines 160 for such normal file 
system tasks as allocating disk space to files, opening and closing existing files, 
and so on. In addition, the file system 140 of the present invention includes a 
file moving program 162, and a "disk usage monitor and balancing program" 1 64. 
The file moving program 162 moves a specified file from one disk to another. 
The usage monitor and balancing program 1 64, which will be described in more 
detail below, monitors the fullness of the system's disk storage devices 1 06-1 1 0 
and their relative levels of disk input/output activity. Based on the observed disk 
usage, imbalances in usage that adversely affect system performance are 
automatically corrected by moving files from one disk storage device to another, 
transparently to the users of the system 100. 

Single Level File Lookup 

Referring to Figure 2, most current file systems use a single-level file lookup 
mechanism for locating files In a file system. These file systems use file 
directories 152 to translate file names to a unique file identifier or disk address, 
which can be used to locate the file's descriptor 170 (also called a file header 
or file descriptor entry) in a table called the file descriptor table 1 54. Each disk 
or logical disk volume in the system contains its own file descriptor table 154, 
which is stored on disk. The file descriptor table 154 stores file descriptor entries 
1 70 which describe each file stored on that particular disk or logical disk volume. 
Each file descriptor entry 170 defines a file's attributes and contains a storage 
map or its equivalent 

Note that a "logical disk volume" may contain more than one physical disk, but 
Is treated by the computer system for memory storage purposes as a single disk 
drive. For the purposes of this document, the terms "disk" and "disk storage 
device" shall mean any logical disk volume, regardless of whether that logical 
volume is one physical disk or many. 

As shown in Rgure 2, two or more application programs 172-176, typically 
running in distinct processes, can have access to a single file 178 either at the 



same time or at different times. Furthermore, each program or process can use 
a different name for the same file. For each such file name, there is a distinct 
directory item 180 in one of the computer system's file directories 152. However, 
there is only one file descriptor 170 for each file, and thus two or more file 
directory entries may point to the same fjle descriptor 170. Such file sharing 
is quite common. 

A problem with the single level lookup mechanism shown in Rgure 2 is that it 
makes it difficult to move a file to a different disk. There are two ways that such 
a file system could move a file. In the first method, It would allocate a new file 
descriptor on the new disk and copy the file and the contents of the old file 
descriptor 170 to the new disk. Then the file system would delete the old file 
descriptor and update the file directory 152 on the old disk so that It points to 
the new descriptor. The problem with this approach is that many file systems 
support multiple directory entries per file, as shown in Figure 2. This means that 
the file system would either have to exhaustively search all the file directories 
152 In the entire computer system for all possible entries that belong to the 
moved file, which is simply not viable in large computer systems that have 
hundreds or thousands of file directories, or it would somehow have to keep track 
of all directory entries 180 for each file, which is a nontrivial problem. 

A second method of moving a file to another disk, starts out with the same initial 
step: allocate a new file descriptor oh the new disk and copy the file and the 
contents of the old file descriptor 170 to the new disk. Then, the file system 
would modify the old file descriptor 170 so that it points to the new disk. The 
problem with this approach is that it clutters the system's disks with "forwarding 
address" file descriptors. In other words, the file descriptor table 154 for one 
disk may end up containing many entries 1 70 which contain fon^^arding addresses 
to other disks. This wastes disk storage space and makes It nearly impossible 
to remove a disk from the file system, because the file descriptor entries 170 
with forwarding addresses are needed to locate all the files that were formerly 
stored on the disk. 



Thus» one of the major deficiencies of current file systems is that they 
permanently bind a file to a particular disk until the file is deleted. 

Two Level Hie Lookup 

Referring to Figure 3. the file system 140 of the present invention adds another 
level into the file lookup mechanism so that the file system can support 
device-independent files. The preferred embodiment does this by adding a Tag 
Directory between the File Directory and the File Descriptor Table. 

The file system 140 uses modified file directories 202-1, 202-2, with modified 
file directory entries 204, to translate a file name into an identifier herein called 
a tag, which points to a tag entry 208 in tag directory 156. In other words, the 
file directory entries 204 in the present invention contain a tag value instead of 
a pointer to a file descriptor entry. 

There is only one tag directory 156 for the entire file system 140, regardless of 
the number of disks in the file system. The location of the tag directory 156 in 
secondary storage is maintained by the file system 140. If a file has multiple 
directory entries 204, then all the directory entries contain the same tag value. 
The file system uses the tag directory 156 to translate the tag value (found in 
the file directories) into a disk address for an entry into the descriptor table 154 
on a specified disk. This translation is performed simply by finding the tag entry 
208 corresponding to the specified tag value, and then retrieving the disk identifier 
and file descriptor Index value stored in that tag entry 208. Thus, each tag entry 
208 Identifies both the disk on which the Tile 1s stored, and references a fiie 
descriptor 170 which specifically locates the file on that disk. 

Referring to Figure 4, using the above described lookup mechanism, the file 
system's file moving routine 1 62 moves a specified file to another disk using the 
following steps {not necessarily in this order): 

A. Allocate a new file descriptor entry on the new disk, Le., the disk to which 
the file is being moved (step 210). 



B. Copy the file and the file descriptor entry from the old disk to the new disk 
(step 212), 

C. De-allocate the old file descriptor entry and the file's disk space (step 214). 

D. Update the file's tag entry so that it points to the new file descriptor entry 
(step 21 6). 

Typically, as In any large computer system, transactional log entries will be 
created by the file system during this process so that the system can recover, 
without data loss, from a system crash during any point of the above described 
file moving process. Such transaction recovery techniques are not part of the 
present invention and are well known to those skilled in the art. 

The tag directory 156 mechanism of the present invention provides a single item 
that needs to be updated when a file is moved, as opposed to updating multiple 
file directory entries as required by previous file systems. Also, If the disk that 
contains the tag directory 156 needs to be removed from the file system, or if 
the tag directory 1 56 needs to be relocated for any other reason, the tag directory 
156 can be moved to another disk. The tag directory is simply copied to a new 
disk, and the tag directory location (maintained by the file system) is updated 
accordingly. 

Referring to Figure 5, the tag directory 1 56 is organized as an array 230 of pages 
232, where each page 232 contains ai header (not shown) and an array of tag 
pointers. Logically, the tag directory 156 is simply an array of tag entries where 
each tag entry represents a tag and entries are sorted in ascending order, starting 
with tag 1. Tags are allocated In ascending order and they are never reused 
(I.e., once a file is deleted. Its tag is not reused). 

The simple organization of the tag directory facilitates fast and efficient tag lookup. 
Given a tag to look up, one only needs to calculate the tag's page number and 
the tag's position within the page. The following formulas are used: 
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TAG'S^PAGE = INTEGER { (TAG-1)>TAGS_PER_PAGE) } 
TAG'SJNDEX = (TAG-1) - TAG'S PAGE x TAGS_PER_PAGE 

As files are deleted, their associated tags become invalid. Since tags are never 
5 reused, tag directory pages will, over time, contain some percentage of invalid 
entries that can never be reused. Eventually, some pages of the tag directory 
1 56 will contain very few active tag entries. While pages with no valid tag entries 
can simply be deleted, pages with just a few active entries waste disk space 
for a very long period of time if the remaining tag entries belong to files that aren't 
10 deleted. 

In one preferred embodiment, whenever a tag directory page 230 contains less 
than a specified number of valid entries (e.g., less than five valid entries for pages 
that hold one thousand tag entries), the remaining valid tag entries are copied 

15 to a secondary tag directory 240 and the tag directory page 230 is de-allocated. 
There is only one secondary tag directory 240 in the file system, and it is 
implemented as a dense ordered array of tag entries. In other words, there is 
little or no unused space in the secondary tag directory 240, avoiding wasted 
disk space. This can be done either by storing the tags in directory 240 in strict 

20 ascending order and using a binary search to find items, or by using a hash table 
to quickly locate items in the secondary tag directory 240. The secondary tag 
directory 240 is accessed only when a tag is not found in the primary tag directory 
array 230. 

25 The process used by the system to locate 'a file when given the file's name 
is as follows. I=irst it uses the file directory to translate the file name into a tag. 
Second, it uses the primary tag directory array 230 to translate the tag Into a 
disk identifier and an index or pointer to- the appropriate item in that disk's file 
descriptor table 1 54. Third, if the tag was not found in the primary tag directory 

30 array 230, the secondary tag directory 240 is accessed to translate the tag into 
a disk identifier and file descriptor item pointer. 
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The primary tag directory array 230 forgoes disk space utilization efficiency in 
order to provide fast tag lookup, whereas the secondary tag directory 240 forgoes 
some degree of tag lookup speed in order to improve disk space utilization. 

5 Automatic Disk Monitoring and File Moving 

Taking advantage of the present invention's ability to move tiles transparently 
(i.e., without affecting the system's users), a human system administrator can 
periodically review the remaining capacity of each of the file system's disks, and 
then move files around to balance disk capacity, thereby preventing any one 

10 disk or set of disks from running out of room. The human system administrator 
could also attempt to periodically monitor the load on each of the file system's 
disks (i.e., the rate of input/output operations to the disks), and move files so 
as to balance the load on various disks. Load balancing can be important 
because each disk arm actuator of a disk storage device can handle no more 

1 5 than a set number of I/O operations per second, with a typical limit being perhaps 
25 I/O operations per second per disk arm. If too many of the most actively used 
files are located on one disk, system performance may degrade due to an 
Input/output bottleneck at that disk. 

20 However, once the ability to move files from one disk to another is made feasible 
by using the present invention, it also becomes feasible to automate the process 
of monitoring disk I/O loads and disk capacity and to automatically move files 
between disks so as to balance either disk loads or disk capacity or both. 

25 Referring to Figure 6, in a preferred embodiment, the file system has a disk usage 
monitor and balancing routine 164 which not only monitors disk capacities and 
disk loads, but also computes time averaged statistical values, such as the 
standard deviations of these two disk usage parameters (step 250). The standard 
deviation values for percentage of capacity available and disk load are indicators 

30 of disparity of disk usage. It is important for these values to be time averages, 
or low pass filtered, so that short bursts of disk activity, or a temporary process 
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that only briefly occupies a large amount of disk space, does not unnecessarily 
cause a large number of files to be relocated. 

When the disk load standard deviation exceeds a corresponding preselected 
5 threshold value (step 252). which is indicative of the dividing line between 
acceptable and unacceptable system performance, the program 1 64 moves files 
from the disk that is most heavily loaded to a comparable disk (i.e, in terms of 
access speed, availability, etc.) which is least heavily loaded (steps 254). 
Similarly, when the disk capacity parameter exceeds a corresponding preselected 

10 threshold value (step 256), the program 164 moves files from a disk that has 
much less than average remaining capacity to a comparable disk with the largest 
remaining capacity (step 258). Then the process starts over again. If the disk 
usage is still Imbalanced, more files are moved, until the computed disk usage 
statistics do not violate the specified threshold criteria for acceptable system 

1 5 performance. The number of files moved during any one iteration of this process 
should be kept conservatively small so as to avoid unnecessary file movement 

In other embodiments, other criteria than standard deviation of disk capacity and 
disk load could be used to determine when files should be relocated. 

20 

While the present Invention has been described with reference to a few specific 
embodiments, the description is illustrative of the invention and is not to be 
construed as limiting the Invention. Various modifications may occur to those 
skilled in the art without departing from the true spirit and scope of the invention 
25 as defined by the appended claims. 



. 13 - 



CLAIMS 

1. A file system for selective redistribution of files for use with a computer 
system having a central processing unit and a multiplicity of distinct data storage 

5 devices on which files are stored, comprising: 

a multiplicity of file directories, stored on various ones of the multiplicity 
of data storage devices, each of which contains a set of file directory entries, 
each file directory entry denoting a file name and a corresponding tag value; 
wherein multiple file names corresponding to a single file may be stored in 
10 multiple ones of i±ie file directories; 

a separate file descriptor table for each of the multiplicity of data storage 
devices, each file descriptor table containing a file descriptor entry for every file 
stored on the corresponding data storage device; 

a tag directory, stored on a selected one of ^ multiplicity of data storage 
15 devices, containing tag entries for a defined set of tag values, wherein every 
file stored in thB multiplicity of data storage devices is represented by a unique 
one of the tag entries, each tag entry denoting the one of the multiplicity 
of data storage devices on which the file corresponding to the tag entry is stored; 
and 

20 file moving means, operable by the central processing unit, for moving 

a specified file from a first one to a second one of the multiplicity of data storage 
devices, including means for copying the specified file to the second one of 
the multiplicity of data storage devices, generating a file descriptor entry for the 
copied file on tiie file descriptor table for the second one of the multiplicity 

25 of data storage devices, and changing the "tag entry in the tag directory 
corresponding to the specified file so that the tag entry denotes the second 
one of the multiplicity of data storage devices; 

whereby a file can be moved from a first one to a second one of the 
multiplicity of data storage devices without having to locate and update all 

30 corresponding file directory entries. 

2, A file system as claimed in Claim 1, further including 
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data storage usage monitoring means, executable by the central 
processing unit, for monitoring usage of t:he multiplicity of data storage devices, 
computing corresponding statistical values indicative of whether the usage is 
imbalanced, comparing the computed statistical values with predefined criteria, 
5 and when the predefined criteria are met, automatically moving files among the 
multiplicity of data storage devices so as to better balance usage thereof. 

3. A file system as claimed in Claim 1, further including 

data storage usage monitoring means, executable by the central 
10 processing unit, for monitoring unused capacity on each of the multiplicity of 
data storage devices, computing statistical values indicative of whether the 
unused capacity is unevenly distributed among said multiplicity of data storage 
devices, comparing the computed statistical values with predefined criteria, and 
when the. predefined criteria are met, automatically moving files among the 
15 multiplicity of data storage devices so as to more evenly distribute unused 
capacity among tzhe multiplicity of data storage devices. 

4. A file system as claimed in Claijn 3, the dat:a st:orage \3sirig 
monitoring means further including means for monitoring loading on 

20 each of the multiplicity of data storage devices, corrputing time ^ 
averaged statistical values indicative of \diether the loading is 
unevenly distributed among the multiplicity of data storage devices, 
conparing the oorrputied statistical values witix predefined criteria, 
and \*ien the predefined criteria are met, automatically moving files 
among the mul1:iplicity of data storage devices so as to more evenly 

25 distribute loading among the multiplicity of data stxsrage devices. 
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5. A method of allocating storage space in a computer system having a 
multiplicity of distinct data storage devices, comprising: 

storing on various ones of the multiplicity of data storage devices a 
multiplicity of file directories, each of which contains a set of file directory entries, 
each file directory entry denoting a file name and a corresponding tag value; 
wherein multiple file names corresponding to a single file may be stored in 
multiple ones of the file directories; 

storing on each of the multiplicity of data storage devices a separate file 
descriptor table, each file descriptor table containing a file descriptor entry for 
every file stored on the corresponding data storage device; 

storing on a selected one of the multiplicity of data storage devices a 
tag directory containing tag entries for a defined set of tag values, wherein every 
file stored in the multiplicity of data storage devices is represented by a unique 
one of the tag entries, each tag entry denoting the one of the multiplicity 
of data storage devices on which the file corresponding to the tag entry Is stored; 
and 

moving a specified file from a first one to a second one of the multiplicity 
of data storage devices, including copying the specified file to the second one 
of the multiplicity of data storage devices, generating a file descriptor entry for 
the copied file on the file descriptor table for the second one of the multiplicity 
of data storage devices, and changing the tag entry in the tag directory 
corresponding to the specified file so that the tag entry denotes the second 
one of the multiplicity of data storage devices; 

whereby a file can be moved from a first one to a second one of the 
multiplicity of data storage devices without having to locate and update all 
corresponding file directory entries. 

6. A method of allocating storage space as clainied in Claim 5, 
further including 

monitoring tisage of tiie multiplicity of data storage devices^ 
oortpiting corresponding st:atistical values indicative of vAiether the 
usage is imbalanced, oonparing the caiputed statistical values with 
predefined criteria, and lAien 
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the predefined criteria are met, automatically moving 
files among the multiplicity of data storage devices so 
as to better balance usage thereof. 

7. A method of allocating storage space as claimed in 
Claim 5, further including 

monitoring unused capacity on each of the 
multiplicity of data storage devices, computing 
statistical values indicative of whether the unused 
capacity is unevenly distributed among the multiplicity 
of data storage devices, comparing the computed 
statistical values with predefined criteria, and when the 
predefined criteria are met, automatically moving files 
among the multiplicity of data storage devices so as to 
more evenly distribute unused capacity among the 
multiplicity of data storage devices. 

8. A method of allocating storage space as claimed in 
Claim 7, further including 

monitoring loading on each of the multiplicity of 
data storage devices, computing time averaged statistical 
values indicative of whether the loading is unevenly 
distributed among the multiplicity of data storage 
devices, comparing the computed statistical values with 
predefined criteria, and when the predefined criteria are 
met, automatically moving files among the multiplicity of 
data storage devices so as to more evenly distribute 
loading among the multiplicity of data storage devices. 
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